Efficient non-uniform time-scaling of speech with WSOLA for CALL applications
نویسنده
چکیده
We consider the applicability of time-scaling for Computer Assisted Language Learning Applications (CALL) and present an efficient algorithm for non-uniform time-scaling. Formal listening tests show a general preference for this non-uniform time-scaling and indicate a dependence of this preference on such factors as the length of the utterance and the desired amount of time-scaling.
منابع مشابه
Overlap-add methods for time-scaling of speech
In this tutorial on time scaling we follow one particular line of thought towards computationally efficient high quality methods. We favor time scaling based on time-frequency representations over model based approaches, and proceed to review an iterative phase reconstruction method for time-scaled magnitude spectrograms. The search for a good initial phase estimate leads us to consider synchro...
متن کاملEpoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals
Timeand pitch-scale modifications of speech signals find important applications in speech synthesis, playback systems, voice conversion, learning/hearing aids, etc.. There is a requirement for computationally efficient and real-time implementable algorithms. In this paper, we propose a high quality and computationally efficient timeand pitch-scaling methodology based on the glottal closure inst...
متن کاملAn Overlap-add Technique Based on Waveform Similarity (wsola) for High Quality Time-scale Modification of Speech
A concept of waveform similarity is proposed for tackling the problem of time-scale modification of speech, and is worked-out in the context of short-time Fourier transform representations. The resulting WSOLA algorithm produces high quality speech output, is algorithmically and computationally efficient and robust, and allows for on-line processing with arbitrary timescaling factors that may b...
متن کاملCHAPTER 15 Time - Domain and Frequency - Domain Techniques for Prosodic Modification of Speech
1. Introductjon 2 General consjderatjons on tjrne-scaling and pjtch-scaling 2.1. Asjrnplemodelforvojcedspeech 2 Tjrne-scalernodificatjon 3 Pjtchl r ifi tj 4 ossjble approaches to prosodic modificatjon 3. The short tjrne Fourjer transforrn and overlap-add synthesjs 3.1. naly js 2 Modifi tjo . 3 Sy th sjs 4. im -scalingtechniques 4 OLAt m -scaling 4.2. y chroniz dOLA rne-scaling 3 WSOLA: An overl...
متن کاملWaveform similarity based overlap-add (WSOLA) for time-scale modification of speech: structures and evaluation
A synchronization criterion for overlap-add time-scale modification is derived through a least squares estimation of the modified short-time Fourier transform. Based on this finding, a structural time-domain framework for time-scale modification is described. One efficient variant, which was called the Waveform Similarity based Overlap-Add (WSOLA) method, produces high quality output when appli...
متن کامل